Performance Analysis of Data Reduction Algorithms Using Attribute Selection in Nsl-kdd Dataset

نویسندگان

  • Meenu Choudhary
  • Vikas Choudhary
چکیده

Abstract—Data sets – both structured & unstructured, which are so large and complex that processing it using database management tools or traditional applications to derive analytical insights becomes difficult. Capturing, storage, internal search, sharing, predictive analysis and visualization are some of the major tasks being performed on such data sets. With the increa sing amount of data generated by social sharing platforms & apps, the process of data reduction has become inevitable. It involves compressing the data being generated & storing it in a data storage environment. In computer networks, these techniques have played a pivot role in increasing storage efficiency and reduced computatio nal costs. In this paper, data reduction algorithms have been applied on NSL-KDD dataset. The output of data reduction algorithm is given as an input to two classification algorithms i.e. PART and Random Forest. The aim is to find out which data reduction technique proves to be useful in enhancing the performance of the classification algorithm. Performance is compared on factors like precision, sensitivity and accuracy.Data sets – both structured & unstructured, which are so large and complex that processing it using database management tools or traditional applications to derive analytical insights becomes difficult. Capturing, storage, internal search, sharing, predictive analysis and visualization are some of the major tasks being performed on such data sets. With the increa sing amount of data generated by social sharing platforms & apps, the process of data reduction has become inevitable. It involves compressing the data being generated & storing it in a data storage environment. In computer networks, these techniques have played a pivot role in increasing storage efficiency and reduced computatio nal costs. In this paper, data reduction algorithms have been applied on NSL-KDD dataset. The output of data reduction algorithm is given as an input to two classification algorithms i.e. PART and Random Forest. The aim is to find out which data reduction technique proves to be useful in enhancing the performance of the classification algorithm. Performance is compared on factors like precision, sensitivity and accuracy.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Parallel Genetic Algorithm Based Method for Feature Subset Selection in Intrusion Detection Systems

Intrusion detection systems are designed to provide security in computer networks, so that if the attacker crosses other security devices, they can detect and prevent the attack process. One of the most essential challenges in designing these systems is the so called curse of dimensionality. Therefore, in order to obtain satisfactory performance in these systems we have to take advantage of app...

متن کامل

A Parallel Genetic Algorithm Based Method for Feature Subset Selection in Intrusion Detection Systems

Intrusion detection systems are designed to provide security in computer networks, so that if the attacker crosses other security devices, they can detect and prevent the attack process. One of the most essential challenges in designing these systems is the so called curse of dimensionality. Therefore, in order to obtain satisfactory performance in these systems we have to take advantage of app...

متن کامل

راهکار ترکیبی نوین جهت تشخیص نفوذ در شبکه‌های کامپیوتری با استفاده از الگوریتم-های هوش محاسباتی

In this paper, a novel hybrid method is proposed for intrusion detection in computer networks using combination of misuse-based and anomaly-based detection models with the aim of performance improvement. In the proposed hybrid approach, a set of algorithms and models is employed. The selection of input features is performed using shuffled frog-leaping (SFL) algorithm. The misuse detection modul...

متن کامل

Feature Selection and Classification of Intrusion Detection System Using Rough Set

With the expansion of computer network there is a challenge to compete with the intruders who can easily break into the system. So it becomes a necessity to device systems or algorithms that can not only detect intrusion but can also improve the detection rate. In this paper we propose an intrusion detection system that uses rough set theory for feature selection, which is extraction of relevan...

متن کامل

Network Intrusion Detection Using Hybrid Simplified Swarm Optimization and Random Forest Algorithm on Nsl-Kdd Dataset

During the last decade the analysis of intrusion detection has become very significant, the researcher focuses on various dataset to improve system accuracy and to reduce false positive rate based on DAPRA 98 and later the updated version as KDD cup 99 dataset which shows some statistical issues, it degrades the evaluation of anomaly detection that affects the performance of the security analys...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014